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Abstract 



We derive new perturbation bounds for eigenvalues of Hermitian matrices with block struc- 
tures. The structures we consider range from a standard 2-by-2 block form to block tridiagonal 
and tridigaonal forms. The main idea is the observation that an eigenvalue is insensitive to 
componentwise perturbations if the corresponding eigenvector components are small. We show 
that the same idea can be used to explain two well-known phenomena, one concerning extremal 
eigenvalues of Wilkinson's matrices and another concerning the efficiency of aggressive early 
deflation applied to the symmetric tridiagonal QR algorithm. 

Keywords: Eigenvalue perturbation, block Hermitian matrix, Wilkinson's matrix, aggressive early 
deflation 

1 Introduction 

Theory of eigenvalues of Hermitian matrices is a well-studied subject, with many aesthetically 
pleasing results available, such as the max-min characterization, Cauchy's interlacing theorem and 
Weyl's theorem [121 Ch.4] [3 Ch.8], [H Ch.4]. Here we are concerned with eigenvalue perturbation 
bounds, and first note that for an eigenvalue A of a given unstructured Hermitian matrix A and a 
general E such that ||-E||2 is bounded by a known constant, Weyl's theorem gives the best possible 
bound that is attainable. 

The goal of this paper is to specialize in block Hermitian matrices and derive perturbation 
bounds of the eigenvalues that are sharper than general bounds, such as Weyl's theorem. 

Much work has been done in this direction as well. For example, a well-known quadratic residual 
bound exists [10^ [8] that relates the eigenvalues of two 2-by-2 block Hermitian matrices of the form 



A 



A x 
A 2 



and A 



E 



E H ' 

A 2 




if \i(A) ^ A(.Ai) is the ith eigenvalue of A then 




(2) 
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Here, Xi(X) denotes the ith smallest eigenvalue of a Hermitian matrix X and X(X) denotes the set 
of X's eigenvalues. 

When \\E\\2 is small, ([2]) provides much tighter bounds than linear eigenvalue perturbation 
bounds do, such as the well-known Weyl's theorem jl], which gives |Aj(A) — Aj(.A)| < ||-E||2- We 
note that [7] gives an improved bound that is always sharper than both the bound by Weyl's 
theorem and ([2]). 

The contribution of this paper is that we present a new framework for deriving new eigenvalue 
perturbation bounds, which is to first obtain bounds for the relevant eigenvector components and 
use them to arrive at bounds of eigenvalues. Based on this framework we give new bounds for the 
2-by-2 block form as in ([I]), but unlike ([I]) we assume no zero submatrix. Specifically, we study the 
difference between eigenvalues of A and A + E, where 



A 



A u A T 21 
A21 A22 



and E 



En E21 
E21 E22 



(3) 



We then turn to the more specialized and well-studied tridiagonal cawse and show a tight bound 
results from our idea when a target eigenvalue is disjoint from many Gerschgorin disks. To demon- 
strate the sharpness of our approach, we show that our framework successfully explains the following 
two well-known phenomena: (i) Wilkinson's matrices have many pairs of nearly equal eigenvalues, 
(ii) Aggressive early deflation applied to the symmetric tridiagonal QR algorithm deflates many 
eigenvalues even when no sub diagonal element is negligibly small. 

The structure of this paper is as follows. In section [2] we survey and derive some necessary 
results, and present our basic idea . Section [3] treats the 2-by-2 block case and proves a new bound 
that is sometimes tighter than any known bound. Section [4] deals with the tridiagonal case, in 
which we investigate the above two case studies. 

Notations: Xi{X) denotes the ith smallest eigenvalue of a Hermitian matrix X. For simplicity 
we use Aj, \i{t) and Aj to denote the ith eigenvalue of A, A + tE and A + E for t € [0, 1] and A and 
E as defined in (J3j) respectively. \{A) denotes the set of the eigenvalues of a Hermitian matrix A. 
(Ti{B) denotes the ith largest singular value of a general matrix B. We only use the spectral norm 



2 Preliminaries 

In this section we review some results that will be necessary for our analysis. We first recall the 
partial derivative of simple eigenvalues [12] . 

Lemma 1 Suppose A and E are Hermitian as in (|3|). Denote by \i{t) the ith eigenvalue of A + tE 
such that (A + tE)x^\t) = \i(t)x {i \t) where ||x (i) (t)|| 2 = 1 for some t G [0,1]. If Xi(t) is simple, 

^ = x(t) H E X (t). (4) 
1 For simplicity hereafter we discard the superscript (i) in the eigenvector x^\i). 
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Since Aj(0) = \i(A) = Aj and Aj(l) = Xi(A + E) = Aj, from @ it follows that if \i{t) is simple for 
all < t < 1, then 



A,; — A; 



< 



x{t) H Ex(t)dt 
xi{t) H E u xi{t)dt 



+ 2 


L 







x 2 (t) H E 2lXl (t)dt 



+ 



x 2 {t) H E 22 xi{t)dt 



(5) 



(6) 



where we partitioned x(t) 



xi(t) 
x 2 {t) 



for X2(t) € C fc . The key observation here is that the latter two 



terms in ([6]) are small if ||x2(i)||2 ^ s sma h for all < t < 1. The next Lemma gives a useful upper 
bound for 1 1 ) 1 1 2 - 



Lemma 2 Suppose that Aj ^ A(^22) is the ith eigenvalue of A as defined in ([3]o Let Ax = XiX 

x 2 



such that \\x\\ 2 = 1. Then, denoting x 

\\x 2 \\ 2 < 



for x 2 € C k we have 
\\A21 2 



minj I A, - A J '(^22)| 
PROOF. The bottom k rows of Ax = Xx is 

A 21 xi + A 22 x 2 = XiX 2 , 

so we have 

x 2 = (Xil - A22) _1 A 2 lXi. 

Taking norms we get 

lla^lh < ||(Aii - -4.22 ) — 1 1 1 2 H ^4.21 1 1 2 1| ^ci 1 1 2 < 



(7) 



\A 



21 2 



min, |Aj - Xj(A 22 )\ 



Ixih < \\x\\ 2 = 1) 



□ 



We note that ([7]) is valid for any Aj and its eigenvector x, whether or not Aj is a multiple 
eigenvalue. Thus for the multiple case, all the vectors that span the corresponding eigenspace 
satisfies ©. 

However, Lemma [T] assumes that Aj is a simple eigenvalue of A to derive the partial derivative 
of Aj with respect to t. Special treatment is needed to get the derivative of multiple eigenvalues, 
and this is shown in the appendix. It turns out that everything that we discuss carries over, in that 
when Aj(t) is multiple, still holds for a certain eigenvector x(t) of Aj(i). We defer the treatment 
of multiple eigenvalues to the appendix, because they only cause complications to the analysis that 
are not fundamental to the eigenvalue behavior. Hence for simplicity we assume that Aj(t) is simple 
for all i, so that the normalized eigenvector is unique (up to a factor e ld ). 



2 If A ^ A(yln) we simply swap the subscripts 1 and 2 in the following arguments. 
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3 2-by-2 block case 



In this section we derive refined eigenvalue perturbation bounds by combining Lemmas [T] and [2J 

Consider the matrix A + tE and its ith eigenvalue Xi(t) such that (A + tE)x(t) = Xi(t)x(t). 
The key observation is that if mm,- |Aj — Xj(A 2 2)\ > H-E22II2 then ([7]) provides an upper bound for 
|M*)|| 2 for all t G [0, 1]: 



\X2 



< 



1^21 lb + II-S21II2 



minj |Aj - Xj(A 2 2)\ - 2\\E\\ 2 
This is verified simply by letting A A + tE in ([7]) , which gives 

\\A21 + tE 2 i\\ 2 



(=Ti), 0<t<l. 



(8) 



F2 



< 



minj |A,j(t) - Aj(A 22 + i-E 1 : 



22 J 



and using the facts that \\A21 + i-E21.il 2 < H-A21H2 + £||-E 2 i|| 2 an d min^ \Xi(t) — A.,(j4 22 + £-E 22 )| > 
mi nj |Ai(0) - Aj(A 22 )| - ||J5|| a - ||-E 22 || 2 for all < t < 1, because |A»(t) - Aj(0)| < t\\E\\ 2 by Weyl's 
theorem. 

Now we present our main result. 

Theorem 1 Let Aj,Aj 6e the ith eigenvalue of A and A + E as in ([3]) respectively, and define 

as in ([8]) . For each i, if n > then 



min.,- |Aj - Aj(A 22 )| - 2\\E\\ 2 

Xi — X 
into 



PROOF. Substituting 



< 1 1 -Ell || 2 + 2||£ 2 i \\Ti + \\E22 || 2 Tj 

we get 



(9) 









Aj — Xi 


< 


[ \\Eu\\ 






Jo 



\dt 



+ 2 


I 







\E- 



21 2 



\xi(t)\\ 2 \\x 2 (t)\\ 2 dt 



+ 



-E 22 | h| F2 



\dt 



< Il-Elllb + 2||-E 2 l|| 2 Tj + ||-E 22 || 2 Tj , 

which is ©. 

Two remarks are in order regarding the theorem. 



□ 



Theorem Q] is tighter than the Weyl bound \\E\\ 2 only when r, < 1, which is mm,- |Aj — 
Xj{A 22 )\ > ||-E 22 || 2 + H-A21H2 + ||-E 2 i|| 2 . If A is far from the spectrum of A 22 and ||-En|| 2 is 
small, then Theorem Q] is much tighter than Weyl's theorem. 

When A 2 \ and E\\ are zero but E 22 and E 2 \ are nonzero, ([9|) reveals that Xi is particularly 
insensitive to the perturbation E 22 : in ([9]), the term involving ||.E 22 || 2 becomes proportional 
to II-E22II2II-E21 Hi) which scales cubically with ||-E||2. 



For example, consider the n-by-n matrices 



and 



An s 
5 



where An is nonsingular. 



A n S 
6 e 

These matrices have n — 1 eigenvalues that match up to 0(e5 2 ), and one eigenvalue that 
matches up to e. 
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4 Tridiagonal case 



We now turn to the symmetric tridiagonal case, and suppose that 



A 



Ol 61 



b n -l 
b n -l 0-n 



and E 



ei A 
/1 ■■■ 



n-l 



fn-1 
£ 11 



(10) 



where we assume without loss of generality that 6j > for all i. Our aim is to show that we 
can derive tighter results based on the same idea as in the previous section. We demonstrate the 
sharpness of our approach by considering the two case studies: 

1. Explain why two largest eigenvalues of Wilkinson's matrices are nearly equal. 

2. Explain why aggressive early deflation can deflate eigenvalues as "converged" when applied 
to the symmetric tridiagonal QR algorithm. 



4.1 Basic idea 

When A and E are symmetric tridiagonal as defined in (|10p . one can of course use Theorem Q] to get 
eigenvalue perturbation bounds. However the tridiagonal structure enables us to refine the bound 
in Lemma [U which in turn yields tighter eigenvalues bounds. 

Specifically, for symmetric tridiagonal A, the kih. row of Ax = Xx is 

+ a k x k + b k x k+ i = Xx k , 

so we have 



|A - a k \\x k \ = \b k -ix k -! + b k x k+ i\ < (|&fe-i| + |6fc|)max(|x fc _i|, |x fc+ i|). 
Therefore if |A — a k \ > \b k ^i\ + \b k \ then \x k \ < max(|xfc_i|, |:Efc+i|) and 

I 1 , |&fc-l| + \h\ n || h 

\x k \ < — r-r 1— max \x k -x , \x k +i \). 

\X - a k \ 

Note that the condition |A — a k \ > 1| + \b k \ implies that the disjointness of a Gerschgorin disk 
from an eigenvalue implies that the eigenvector components are decaying. This observation was 
made in [11] . 

Below we show how this idea can be used in practice. 
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4.2 Eigenvalues of Wilkinson's matrix 

The well-known Wilkinson's matrix [15], whose famous 2n + 1 = 21 case is 

"10 1 
1 9 '■. 



w. 



21 



1 

1 1 1 

1 1 

1 1 



1 

10 



(11) 



Such matrices are known to have many pairs of extremely close eigenvalues: for example, the two 
largest eigenvalues agree up to about 7 x 10 -14 . [151 P-308] notes that in general the two largest 
eigenvalues of the matrix W 2 + n+1 agree up to roughly (n!) -2 , but does not explain this in detail. We 
shall give an explanation using the ideas we described in this paper. Define (2n + l)-by-(2n + 1) 
matrices A and E such that A + E = W%\ by 



10 1 
1 9 



.4 



1 

1 

1 

1 



1 

10 



E 



1 





(12) 



Note that A has 10 (in general n) pairs of multiple eigenvalues with multiplicity 2, and a simple 
eigenvalue 0. We show that the large eigenvalues of A (those close to 10) are extremely insensitive 
to the perturbation E, so that the two largest eigenvalues of A + E must be very close to those of 
A, hence close to each other. 

Below we suppose n > 4. First we consider the largest eigenvalue of A, which we denote by 
A(> n). Define x(t) = [x\{t) x 2 (t) ■ ■ ■ X2 n +i{t)} such that (A + tE)x(t) = X(t)x(t) where X(t) is a 
continuous function of t with A(0) = A. 

Let us recall Lemma [2] and consider refining the bound for |a; n (t)|, |a; n +i(t)| and |x n+ 2(i)|. Since 
A is tridiagonal, from the (n + l)th row of (A + tE)x(t) = \(t)x(t) we have 



X(t)x n+ i(t) = t(x n (t) + x n+2 (t)), 



hence 



\x n +l(t)\ < 



2tmax(\x n (t)\, \x n+2 (t)\) 



t € [0,1]. 



(13) 
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First we consider the (t)\ > \x n+2 (t)\, in which case we also have |x n (i)| > |x n +i(t)| in view 

of (j!3|) . From the (n — l)th row of (A + tE)x(t) = X(t)x(t) we similarly get 

|*n-!(t)l < fez^±i^ ; te[0jl] . (14 ) 

Now since n < X(t) < n + 1 for all t € [0, ljj we must have |x n _2(t)| > |:r n _i(i)| > |x n (i)|. 

Substituting this into (|14p yields |x n _i ( t )| < *l^-2(*)l , Therefore we have 

A(t) — 1 — t 

I m , ^ t\x n - 2 (t)\ \x n _ 2 (t)\ + ^r nl i 

By a similar argument we find that 

L_ iW |<^!dM<hzLlM fo r l <i < n -2, 0<t < 1, (15) 
n — i — 1 n — z — 1 



2t, 1 2iVr 2 



so together with (fT3|) we get 

i wo i < -n n — ^-r ^ - n — L —{ 

?2 -*- x n — ? — 1 n ■ LJ - n — i — 1 

i=i i=i 

and 

n-2 ^ 

|x»(t)|<*n — — r ( 17 ) 

- LA n — i — 1 

When n = 10 we have 5q < 5\ < 5 x 10~ 5 . which we now plug into Q to get 



\\i(A + E)-\i(A)\ < 



x(t) H Ex(t)dt 



< I 2{\x n {t)\ + \x n+2 (t)\)\x n+1 {t)\dt 

n-2 \ 2 i 



< - 

n 



4 fe 2 1 



s Wrn ■ < 18 > 



,i=l 



The case |x n _i(t)| < |cc n +i(i)| can also be treated similarly, and we get the same result. 

We easily appreciate that the bound (fT8l) roughly scales as l/n((n — 2)!) 2 as n — > oo, which 
supports the claim in [15]. 

We also note that by a similar argument we can prove that the 21 — 1th and 2£th eigenvalues 
of W 2n -l i match to within l/(n — I + l)((w — £ — 1)!) 2 > which is small for small but not as small 
for larger I. Since this is an accurate description of what is well known about the eigenvalues of 
W-jn+i, we conclude that this approach explains the observation that Wilkinson's matrix has many 
pairs of eigenvalues that are nearly equal. 



We can get n < \(t) < n + 1 by first following the same argument using n + 2 > A(t) > n — 1. 
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4.3 Aggressive early deflation applied to symmetric tridiagonal QR 

The aggressive early deflation strategy introduced in [3] for the nonsymmetric Hessenberg QR 
algorithm, is known to greatly speed up the algorithm for computing the eigenvalues of a nonsym- 
metric matrix by deflating converged eigenvalues much earlier than a conventional deflation strategy 
does. Here we consider the symmetric tridiagonal QR, for which a similar (though perhaps not as 
dramatic) performance improvement is expected. 

The following is a brief description of aggressive early deflation applied to the symmetric tridi- 
agonal QR. Let A a tridiagonal matrix as defined in (PTUj) . Denote by A 2 the lower-right k x k 
submatrix of A, and let A 2 = VDV T be an eigendecomposition, where the diagonals of D are in 
decreasing order of magnitude. Then, we have 



(19) 



where A\ is the upper-left {n—k) x (n—k) submatrix of A, and the vector t is given by t = b n -kV(l, '•) 
where V(l, :) is the first row of V. It often happens that many elements of t are small, in which 
case aggressive early deflation regards D's corresponding eigenvalues as converged and deflate them. 
This is the case even when none of the subdiagonals of A is particularly small. 

'Ai 
A 2j 

almost equal, or equivalently that the perturbation of the eigenvalues by the (n — k)th subdiagonal 
6 n _fc is negligible. Here our aim is to explain why this is often the case. We do this by showing 



I 


T 

A 


'I 




' A, 


t T 


V 


V 




t 


D 



This means that many eigenvalues of the two matrices A = A + E and A 



arc 



that under an assumption that is typically valid of a tridiagonal matrix appearing in the course of 
the QR algorithm, many eigenvalues of A 2 are perturbed only negligibly by b n _ k . Again we rely 
on our idea of bounding relevant eigenvector components. 

It is well-known that under mild assumptions the tridiagonal QR algorithm converges, in that 
the diagonals converge to the eigenvalues in descending order of magnitude, and the subdiagonal 
elements converge to zero |14| . In light of this, here we assume that the diagonals en are roughly 
ordered in descending order of their magnitudes, and that the subdiagonals bi are small, so that 
for a target (small) eigenvalue X(A) of A 2 , we have |aj — A| > bi + alpha for 1 < i < n — k + j for 
some j > 0. Here, a is a bound such that |A(i) — A| < a for all < t < 10- 

We bound the perturbation on A by 6 n _ s by tracing A via (j4]). Specifically, defining a continuous 
function \(t) of t such that (A+tE)x(t) = X(t)x(t) where A(0) = X and x(t) = [x\(t) x 2 (t) • • • x n (t)] 
is a unit vector for all t, we shall bound |A(1) — A (0) | . We shall prove the following. 

Proposition 4.1 Under the above notations and assumptions, 



|A(l)-A(0)|<^ n _ fe n^_ fe+i 



(20) 



i=l 



where r\i = — — — . 

\o-i — A| — a — bi 

PROOF. First note from ([4]) that 



A(1)-A(0)= / x n ^ k {t)x n ^ k+l {t)dt. 
Jo 



(21) 



4 We can safely let a — which works, but we can get a much smaller bound for example by using the argument 
here recursively 
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From the first row of (^4 + tE)x(t) = X(t)x(t) we have 

(a 1 -X(t))\x 1 (t)\ = b 1 \x 2 (t)\, 

Hence 



= bj_ 

\ x 2(t)\ ai — X(t) ^ \ai — A| — a 



< 



< 1, 



Next, assuming < for some integer i < n — k + j, from the ith row of (A + tE)x(t) 

X(t)x(t) we have 

a k - X(t) - b k -i ^ fc ~^f. ) = b k \x k+ i(t)\, 



\ x k(t)\ 



so it follows that 



h 



\xk(t)\ 

\xk+i(t)\ ~ ( 0fc _A(*)-6 fc _i%ff) 
h 



< 



lofc - AI - a - b k . 



\xk(t)\ 

< 1, 



where the last inequality follows from the assumption — A| > hi + a. Hence, we have shown for 
1 < i < n — k + j that \xi(t)\ < \x i+ i(t)\ for all < t < 1. 



Therefore, we have 



\x n - k (t)\ 

\x n -k+j{t)\ 



< 



n 

i=l 



'n—k+i 



a-n-k+i — A| — a — 6 n _fc + j_i 



Since this holds for all < t < 1, we conclude that 



A(l) - A = A(l) - A(0) = / x n _ k (t)x n _ k+1 (t)dt 

'0 



< 



1 

< - • 



%n—k \t) 



■K-n—k+j (^) 



Xn-k+l(t) 



■K-n—k+j (^) 



(It 



bl- k 



2 \a n - k — X\ — a — 



n 



bn—k+i 



, Wn-k+i — A I — a — b n _ k+ i- 



b n — k r\ n _ k t— 1- 2 

S 7) ILVn-k+i- 

i=l 



Now let us analyze the result. The bound (|20|) is a product of j numbers r]^_ k+i for 1 < i < j, 
where each rj n _ k+ i can be much smaller than 1 if A is a nearly diagonal matrix. Moreover, the 
above argument is valid as long as the assumption on A that it is far from the n — k + j diagonals 
of A is true. 
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Simple example To illustrat the result, let A be the 1000-by-1000 tridiagonal matrix 

1 1.11] 

A = tridiag ^ 100 999 . . 2 1 . (22) 
1 1 1 1 J 

Let k = 100, and let us target on an eigenvalue A of A2 that is smaller than 10 (there are at least 9 of 

them). For such A we can let j = 88. Since rji = " k+l — ; < —. < 

\a n -k+i — X\ — a — 1 100 - 1 - A| - 1 

— (where we used the safe bound a = 1), Proposition 14.11 gives a bound 



2(90-1) f = \ (|90 -i\ - l) 2 
<1.7 x 10~ 271 . 

This shows that all the eigenvalues of A2 that are smaller than 10 can be hardly perturbed by the 
subdiagonal 6 n _fe (the same argument shows that more than 80 eigenvalues of A2 can be regarded 
as converged to within accuracy 10 -16 ). 



A Multiple eigenvalues 

In the text we assumed that all the eigenvalues of A + tE are simple for all < t < 1. Here we 
treat the case where multiple eigenvalues exist, and show that the results we proved hold exactly 
the same. 

We note that in [TJ [2] it is claimed that A + tE can only have multiple eigenvalues on a set of t 
of measure zero, and hence ^ can be integrated on t such that A + tE has only simple eigenvalues. 
However we cannot use this argument, which can be seen by a simple counterexample A = E = I , 
for which A + tE has a multiple eigenvalue for all < t < 1. Hence we need a different approach. 



A.l Muliple eigenvalue first-order perturbation expansion 



First we review a known result on multiple eigenvalue first-order perturbation expansion |13|. I9l 16]. 
Suppose that a Hermitian matrix A has a multiple eigenvalue Ao of multiplicity r, such that there 
exists a unitary matrix Q = (Q±, Q2) such that 



Q H AQ 



X I 
A 



(23) 



where A is a diagonal matrix that contains eigenvalues that are not Ao- Then, the matrix A + eE 
has eigenvalues Ai, A2, . . . , X r admitting the first-order expansion 

A l = A +/i l (Qf^Qi)e + o(e), i = 1, 2, . . . , r, (24) 

where ^{(Y^ 1 EX±) denotes the ith eigenvalue of the r-by-r matrix Q^EQ\. 

Using (|24p , we obtain the partial derivative corresponding to ^ when A + tE has a multiple 
eigenvalue \i(t) = Aj + i(i) = • • • = Aj +r _i(t) of multiplicity r: 



i +j-l 

dt 



(t) 



^(QritfEQ^t)), l<j<r. 



(25) 
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Now, let Qi(t) EQi(t) = U H AU be the eigendecomposition where the diagonals of 
A = diaginjiQ^t^EQ^t))) are in descending order. Then A = UQi{t)EQi(t)U H = Qi(t)EQi(t), 
where Qi(t) = Q\(t)U H , so Hj{Q\{t) H ' EQ\{t)) = qj{t) H Eqj(t), where qj(t) denotes the jth column 
of Q\(t). Now, since any vector of the form Qi(t)v is an eigenvector corresponding to the eigenvalue 
Aj(t), so is Oj(t). Hence we can still write the first-order perturbation expansion of each eigenvalue 
as in ((H), and since Lemma [2] holds regardless of whether A, is a multiple eigenvalue or not, we 
conclude that all the results in the text hold exactly the same when multiple eigenvalues exist. 



A. 2 Note on the trailing term 

Here we claim that the expansion can be made sharper in that the trailing term can be 0(e 2 

En E21 



instead of o(e) as in the known result (J23J). To see this, we write E 
that 

Q H (A + E)Q 



E 2 i E22 



, and see in ([23 



Xol + Q l EnQi Q l E21Q2 
Q2 E 2 iQi A + Q 2 E22Q2 



For sufficiently small E there is a positive gap in the spectrums of the matrices XqI + Q^EnQi 
and A + Q% Hence, using the quadratic eigenvalue perturbation bounds in [7] we see that 

the ith eigenvalue of Q H (A + E)Q and those of + Q i EnQl 
by 



2\\E\\l 



differ at most 



gap + 1/ gap 2 + ijji^jj! 
be replaced by 



A + Q%E 22 Q2_ 

. This is of size 0(e 2 ) because gap > 0. Therefore we conclude (I24p can 



\i = \ + ^(Q?EQi)e + O{e< 



1,2,. ..,r, 



(26) 
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